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Abstract 

In this paper we consider the reconstruction problem on the tree for 
the hardcore model. We determine new bounds for the non-reconstruction 
regime on the fc-regular tree showing non-reconstruction when 

(ln2-o(l))ln2fe 
^ 21nlnfc 

improving the previous best bound of A < e — 1. This is almost tight as 
reconstruction is known to hold when A > {e + o{l)) In^ k. We discuss the 
relationship for finding large independent sets in sparse random graphs 
and to the mixing time of Markov chains for sampling independent sets 
on trees. 



1 Introduction 

The reconstruction problem on the tree was originally studied as a problem in 
statistical physics but has since found many applications including in computa- 
tional phylogenetic reconstruction [S] , the study of the geometry of the space of 
random constraint satisfaction problems [U [T3] and the mixing time of Markov 
chains [16] . For a Markov model on an infinite tree the reconstruction prob- 
lem asks when do the states at level n provide non-trivial information about the 
state at the root as n goes to infinity. In general the problem involves deter- 
mining the existence of solutions of distribution valued equations and as such 
exact thresholds are known only in a small number of examples [H [TOl O [13] ■ 
In this paper we analyze the reconstruction problem for the hardcore model on 
the fc-regular tree, where each vertex of the tree has degree k. The hardcore 
model is a probability distribution over independent sets / weighted proportion- 
ally to A'^I. Previously Brightwell and Winkler [7] showed that reconstruction 
is possible when A > (e -f o(l)) In^ k. Improving on their bound for the non- 
reconstruction regime, Martin |15j showed that non-reconstruction holds when 
A < e — 1 still leaving a wide gap between the two thresholds. Our main result 
establishes that the bound of Brightwell and Winkler is tight up to a Inlnfc 
multiplicative factor. 
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Theorem 1 The hardcore model on the k-regular tree has non-reconstruction 
when 

(ln2-o(l))ln2fc 

— ; — ; • 

21nliifc 

1.1 The Hardcore Model 

For a finite graph G the independent sets I{G) are subsets of the vertices con- 
taining no adjacent vertices. The hardcore model is a probability measure over 
cr e I{G) C {0, 1}"=^ such that 

p(^) = ^A^^.o'^"l.g,(G) (1) 

where A is the fugacity parameter and Z is a normalizing constant. The defi- 
nition of the hardcore model can be extended to infinite graphs by way of the 
Dobrushin-Lanford-Ruelle condition which essentially says that for every finite 
set A the configuration on A is given by the Gibbs distribution given by a ran- 
dom boundary generated by the measure outside of A. Such a measure is called 
a Gibbs measure and there may be one or infinitely many such measures (see e.g. 
[llj for more details). For every A, there exists a unique translation invariant 
Gibbs measure on the fc-regular tree and it is this measure which we study. 
An alternative equivalent formulation of the hardcore model is as a Markov 
model on the tree. An independent set a is generated by first choosing the root 
according to the distribution 



(tTIjTTo) = 



UJ 1 + CJ 

1 + 2w ' 1 + 2a; 



for some < a; < 1. The states of the remaining vertices of the graph are 
generated from their parents' states by taking one step of the Markov transition 
matrix 



M 
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1 

Ul 1 
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It can easily be checked that tt is reversible with respect to M and that this 
generates a translation invariant Gibbs measure on the tree with fugacity 

X = uj{l + oj)''-\ 

Restating Theorem [l] in terms of u we have non-reconstruction when 
1 r. 



lnfc + lnlnfc-lnlnlnfc-ln2 + lnln2-o(l) =: Q. (2) 



We will introduce some further notation which we will make use of in the proof. 

_ TTq 1+OJ 1 

TToi = — , A = TToi - i = — , 

TTl W U! 

= P00-PW ^Pll -POI = 
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A particularly important role is played by 9, the second eigenvalue of M as is 
discussed in the following subsection. We denote by Py,Ei^ (and resp. Pj,,Ej, 
and Pt,Et) the probability and expectations with respect to the measure ob- 
tained by conditioning on the root p of T to be 1 (resp. 0, and stationary). We 
let L = L{n) denote the vertices at depth n and cr{L) = a{L{n)) denote the 
configuration on level n. We will write PrT['W{L) = A] to denote the measure 
conditioned on the leaves being in state A S {0, 1}^'"^. 

1.2 The reconstruction problem 

The reconstruction problem on the tree essentially asks if we can recover infor- 
mation on the root from the spins deep inside the tree. In particular we say 
that the model has non-reconstruction if 

Pr[(Tp - l\a{L)] ^ TTi (3) 

in probability as n cxi, otherwise the model has reconstruction. Equivalent 
formulations of non-reconstruction arc that the Gibbs measure is extremal or 
that the tail cr-algebra of the Gibbs measure is trivial [2T]. It follows from 
Proposition 12 of 20J that there exists a A/j such that reconstruction holds for 
A > Afl and non-reconstruction holds for A < Xj^. The reconstruction problem 
is to determine the threshold Xr. 

1.3 Related Work 

A significant body of work has been devoted to the reconstruction problem on 
the tree by probabilists, computer scientists and physicists. The earliest such 
result is the Kesten-Stigum bound [T^ which states that reconstruction holds 
whenever 6*^(^ — 1) > 1. This bound was shown to be tight in the case of 
the Ising model [H [T^ where it was shown that non-reconstruction holds when 
6'^{k — 1) < 1. Similar results were derived for the Ising model with small 
external field ^ and the 3-state Potts model which constitute the only 
models for which exact thresholds are known. On the other hand, at least when 
k is large, the Kesten-Stigum bound is known not to be tight for the hardcore 
model [7]. As such, the most one can reasonably ask to show is the asymptotics 
of the reconstruction threshold Xii{k) for large k. 

The Kesten-Stigum bound is known to be the correct bound for robust re- 
construction for all Markov models [12]. Robust reconstruction asks whether 
reconstruction is possible after adding a large amount of noise to the spins in 
level n. It was shown in [T2] that when 0^(fc — 1) < 1 after adding enough noise 
to the spins at level n, the "information" provided by the modified spins at level 
n decays exponentially quickly. 

In both the colouring model and the hardcore model the reconstruction thresh- 
old is far from the Kesten-Stigum bound for large k. In the case of the hardcore 
model 0^(fc — 1) = (l + o(l)^ In^ k. As such, given a noisy version of the spins at 
level n, the information on the root decays rapidly as n grows. In the colouring 
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model close to optimal bounds [3112] were obtained by first showing that, when 
n is small, the information on the root is sufficiently small. Then a quantitative 
version of [12] establishes that the information on the root converges to ex- 
ponentially quickly. The hardcore model behaves similarly. Indeed, the form of 
our bound in equation ([2| is strikingly similar to the bound for the g-coloring 
model which states that reconstruction (resp. non-reconstruction) holds when 
the degree is at least (resp. at most) q\h\q -I- In In g -I- 0(1)]. 
Our proof then proceeds as follows. We first establish that when u) satisfies ([2| 
then even for a tree of depth 3 there is already significant loss of information of 
the spin at the root. In particular we show that if the state of the root is 1 then 
the typical posterior probability that the state of the root is 1 given the spins at 
level 3 will be less than i. The result is completed by linearizing the standard 
tree recursion as in [5] [23]. In this part of the proof we closely follow the notation 
of [5] who analyzed the reconstruction problem for the Ising model with small 
external field. We do not require the full strength of their analysis as in our case 
we are far from the Kesten-Stigum bound. We show that a quantity which we 
refer to as the magnetization decays exponentially fast to 0. The magnetization 
provides a bound on the posterior probabilities and this completes the result. 



Replica Symmetry Breaking and Finding Large Independent Sets 

The reconstruction problem plays a deep role in the geometry of the space 
of solutions of random constraint satisfaction problems. While for problems 
with few constraints the space of solutions is connected and finding solutions 
is generally easy, as the number of constraints increases the space may break 
into exponentially many small clusters. Physicists, using powerful but non- 
rigorous "replica symmetry breaking" heuristics, predicted that the clustering 
phase transition exactly coincides with the reconstruction region on the asso- 
ciated tree model [THl [T3] . This picture was rigorously established (up to first 
order terms) for the colouring and satisfiability problems ^ and further ex- 
tended to sparse random graphs by [19]. As solutions are far apart, local search 
algorithms will in general fail. Indeed for both the colouring and SAT models, 
no algorithm is known to find solutions in the clustered phase. It has been 
conjectured to be computationally intractable beyond this phase transition [T]. 

The associated CSP for the hardcore model corresponds to finding large inde- 
pendent sets in random fc-regular graphs. The replica heuristics again predict 
that the space of large independent sets should be clustered in the reconstruction 
regime. Specifically this refers to independent sets of size sn where s > 7ri(i?), 
the density of I's in the hardcore model at the reconstruction threshold. It is 
known that the largest independent set is with high probability i^niiiiliilL^Tj [g]. 
On the other hand the best known algorithm finds independent sets only of size 
i^^^i^^ll^BJiji which is equal to Tii{R)n [53]. This is consistent with the physics 
predictions and it would be of interest to determine if the space of independent 
sets indeed exhibits the same clustering phenomena as colourings and SAT at 
the reconstruction threshold. Determining the reconstruction threshold more 
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precisely thus has impHcations for the problem of finding large independent sets 
in random graphs. 



Glauber Dynamics on trees 

The reconstruction threshold plays a key role in the study of the rate of conver- 
gence of the Glauber dynamics markov chain for sampling spin systems on trees. 
This problem has received considerable attention (see e.g. [2^ i3j JJl HZl HI] ) and 
in the case of the Ising model, the mixing time is known to undergo a phase 
transition from f?(nlnn) in the non-reconstruction regime to n^"*"^'^-* in the re- 
construction regime [2]. In fact, the mixing time is for any spin system 
above the reconstruction threshold. A similar transition was shown to take 
place for the colouring model [23]. Sharp bounds of this type are not known 
from the hardcore model, however, it is predicted that the Glauber dynamics 
should again be 0(n log n) in the non-reconstruction regime. 



2 Proof of Theorem [1] 

It is simple to show that non-reconstruction on the /c-regular tree is equivalent to 
non-reconstruction on the (fc — l)-regular tree. For ease of notation we establish 
our bounds for the k-aiy tree noting that in equation ^ we have that Li){k + 
1) — uj{k) — o{k) so the difference can be absorbed in the error term. Let T 
denote the infinite fc-ary tree and let r„ denote the restriction of T to its first 
n levels. 

Before reading further, it might help the reader to quickly recall the notation 
from the end of Section As in [5] we analyse a random variable X which 
denotes weighted magnetization of the root which is a function of the leaf states 
of the tree. We define X — X{n) on T„ by 



X = na'[noV{ap = l\A)~niV{ap^O\A)] 
1 



(4) 



Since ET[P[crp = 1\A]] — V[ap = 1] = tti, from the above expression, we have 
that E[X] = 0. Also, X <1 since P[crp = 1|A] < 1. We will make extensive use 
of the following second moments of the magnetization. 

X = Et[x^], Xi = [a:% Xo^K [x'^] 

With these definitions in hand, by the definition in ([S]) we can characterize 
non-reconstruction as follows. 

Proposition 2.1 Non-reconstruction for the model {T,M) is equivalent to 

lim X(n) = 0, 

where X{n) = Et„ [X'^] ■ 
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In the remainder of the proof we derive bounds for X. We begin by showing that 
already for a 3 level tree, X becomes small. Then we establish a recurrence along 
the lines of [5] that shows that once X is sufficiently small, it must converge to 
0. As this part of the derivation follows the calculation in [5] we will adopt their 
notation in places. Non-reconstruction is then a consequence of Proposition [O] 
In the next lemma we determine some basic properties of X. 

Lemma 2.2 The following relations hold: 

a) Et[X] = 7riEi,[X] + ttqEO [X] = 0. 

b) X = 7riXi+7roXo. 

c) E^[X] = TToiX and E°.[X] = -X. 

Proof: Note that for any random variable which depends only on the states 
at the leaves, / = f{A), we have Exif] — 7riEy[/] + 7roE^[/]. Parts a) and b) 
therefore follow since X is a random variable that is a function of the states at 
the leaves. For part c) we proceed as follows. The first and last equalities below 
follow from Q. 



TTi \ TTi 

_i /ET[(PT[ap = l\A]y^ 



A 

- "01 ^ ^2 

The second part of c) follows by combining this with a). □ 
The following proposition estimates typical posterior probabilities which we will 
use to bound X. For a finite tree T let T* be the subtrees rooted at the children 
of the root Ui. 

Proposition 2.3 For a finite tree T we have that 

a) For any configuration at the leaves A — {Ai, • • • , Ak), 

Pt^p = 0|f7L = A]=(l + aJJPt.K, - OlcTL, - Ai 

i 

h) Let A he the set of leaf configurations 

A = \a{L) I P[ap = 0|a(L)] = \{l ^ 



2 V 1 + 2A 
Then 

Pg.[g(L) € A] _ TTi 1 + A 
P;^[cr(L) € ^] ~ TTo A 
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c) Let P >\n2-\n\n.2 and uj = ^ [in fc + Inln A; - Inlnln fc - /?] . Then in the 3 
level k-ary tree T3 we have that 

E}r^[V[a, = l\am < ^. 

Proof: Part a) is a consequence of standard tree recursions for Markov models 
established using Bayes rule. 
For part b) first note that 



'[ap = 1 I ct(L) eA] = 1 - V[(Tp = I (t(L) e A] 

_ I ( I 

^ 2 V " 1 + 2A 



Now, 



K[o{L)eA] ^ - I -jL) e AnajL) e A] 

TTi 1 + A /P[CTp = 1 I (t(L) e A]V[(7{L) e A] 

TTo A V 

= "^'-^KWWeA] 

TTo A 

where the first and third equations follow by definition of conditional probabil- 
ities and the second follows from ^ which establishes 6). 

For part c), we start by calculating the probability of certain posterior proba- 
bilities for trees of small depth. Note that with our assumption on to we have 
that 

e^'^ln k 

By part a), since o'(L) = 1 under we have that 

Pi^jap = 0|a(i)] = ^ w.p. 1. 



Also, 

PTlK = OV^|ap = 0)= 1^- 
Using the two equations above, we have that 



1 



- OJ 



, 1 w.p. 1 — ( TX 

Applying part a) to a tree of depth 2, we have 
Pi^jap = 0|a(i)] = 



l + \Yl,PlK.=0\a{L)] 
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Therefore 



^^J-P = OKL)] = ^ 1(1 + ^) .... (6) 



1 

l+A 

fc-l / \ k 



1+2A 



o.w. 



By part b) with ^ as defined, and ^ we have that after substituting the ex- 
pressions for A and us, 

TTo A 



X{l+uj)\ \l + iuj I \l+uj 
, e'^ Inlnfc 

> ^ (7) 

We can now calculate the values of Pj-^Wp — 0|o-(i)] as follows. By part a) 

Denote ^ ^ 

^ = xItt^ " (tt^) ) (tt^) ^ 

By Chernoff bounds, and the bound on p from ([t]), 

¥{Bin{k,p) < e'' Inlnfc- 2\/e'3 Inlnfc) < -. 

3 

Finally, by the definition of A, 
and hence, 

Ei,3[P[ap = lKL)]] < ( 1 



2 1 
3+ 3 



1 + A[2(l - Ofe(l))]-("' lnl„fe-2Ve^* Inlnfc) 

By taking fc large enough above, we conclude that for /? and large enough k 



□ 
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Figure 1: A finite tree T 



Lemma 2.4 Let /3 > In 2 — In In 2 and lj = ^ [ In fc + In In fc — In In In fc — /?] . For 
k large enough, 

Xi3) < ^. 



Proof: By part c) of Lemma 2.2 and part c) of Proposition 2.3 
X(3) = -. 



Ei,JP[a, = l I aim 



< 



< 



27ri 



- 1 



Next, we present a recursion for X and complete the proof of the main result. 
The developement of the recursion follows the steps in [5 closely so we follow 
their notation and omit some of the calculations in this short version. 



Magnetisation of a child 

With T and x as defined previously, let y be a child of a; and let T' be the 
subtree of T rooted at y (see Figure 1). Let A' be the restriction of A to the 
leaves of T' . Let Y = Y{A') denote the magnetization of y. 

Lemma 2.5 We have 

a) E\,[Y] = eE\„[Y] and E^[Y] = 9E'^,[Y]. 

b) E\.[Y^] = {1-0)Et'[Y^]+9E\.,[Y^]. 

c) E9^[Y'^] = {l-e)ET'[Y^']+eE9^,[Y\ 
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Figure 2: The tree T after obtained after raerging T' and T" . The dashed 
subtree is T. 

The proof follows from the first part of Lemma |2.2| and the Markov property 
when we condition on x. 

Next, we can write the effect on the magnetization of adding an edge to the 
root and merging roots of two trees as follows. Referring to Figure 2, let T' 
(resp. T") be a finite tree rooted at y (resp. z) with the channel on all edges 
being given M, leaf states A (resp A") and weighted magnetisation at the root 
Y (resp. Z). Now add an edge {y, z) to T" to obtain a new tree T. Then merge 
T with T' by identifying y = y to obtain a new tree T. To avoid ambiguities, 
denote by x the root of T and X the magnetization of the root of T. We let 
A = {A' , A") be the leaf state of T. Let Y be the magnetization of the root of T. 

Note: In the above construction, the vertex y is a vertex "at the same level" 
as X, and not a child of x as it was in Lemma 1 2. 5 1 

Lemma 2.6 With the notation above, Y = 9Z. 

The proof follows by applying Bayes rule, the Markov property and Lemma |2.2[ 
These facts also imply that 

Lemma 2.7 For any tree T, 

^ _ Y + Y + AYY 
l + TToiYY 

With these lemmas in hand we can use the following relation to derive a recursive 
upper bound on the second moments. We will use the expansion 



10 



1 

1 + r 



Taking r = ttqiYY, by Lemma 
X = {Y + Y + AYY) 



2.7 



1 - r 



we have 



1 - TToiYY + (TTo.YYy^ 



l+TTQlYY\ 

Y + Y + AYY - noiYY (y + Y + AYy) + {TroifiYY)^X 



< Y + Y + AYY - TToiYY {Y + Y + AYY + (7roi)^(rr) 



(8) 



where the last inequahty foUows since X < I with probabihty 1. 
Let p' — Yi/Y and p" — Zi/Z. Below, the moments Y etc. are defined 
according to the appropriate measures over the tree rooted at y (i.e. T') etc. 
By applying Lemmas 2.2 2.5 and 2.6 we have the following relations. 



E].[X] = TToiX, E^Y] = TToiV, E^t[Y^] = Yp' 

E^[Y] = TToiO^Z, E^j,[Y^] = e^Z{{\ -6) + Op") (9) 

Applying (7roi)^^-E'^[-] to both sides of ([s]), we obtain the following. 

X < Y + 9^Z + ATToiYZ-TTQie^YZp'-TTaie^YZ{{l-e)+ep") 
-Ae^YZp'iil -6) + Op") + 7roi02y^p'((l -9)+ 9p") 
= Y + e'^'Z -'Koie'^YZ[A- AB] 



where 



A = p' + {l-p')[{l-. 



LjJ 



and B = l-{nai)-^p'[{l-e) + ep"] = l- -—p'[{\-e) + ep"]. 

1 + UJ 

liA — AB > 0, this would already give a sufficiently good recursion to show that 
X{n) goes to 0, so we will assume is negative and try to get a good (negative) 
lower bound. First note that by their definition > 0. Further since 

F= TTlFl +7roFo, 

1 + 2^ 

P < (tti) = . 

UJ 

Similarly, 

P" < (..)- - 



Since E\,[Y^] and Z > 0, it follows from (|9| that (1-6) + Op" > 0. Together 
with the fact that p' > 0, this implies that B < I. 
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Since A is multi-linear in {p',p"), to minimize it, its sufficient to consider the 
extreme cases. When p' = 0, A is minimized at the upper bound of p" and 
hence 

-4> l-TToi-^ =0. 

1 + UJ 

When p' ^ {tti)-\ 

A - (^i)-i + (1 - (7ri)-i)[l - 9(1 - p")] > 0. 

Hence, we have 

X <Y + e^z+ -^—YZ. 

~ l+UJ 

Applying this recursively to the tree, we obtain the following recursion for the 
moments. 



X < {l + uj)9^ 
We bound the (1 + x)'' — 1 term as. 



Z 



- 1 



~\x\k 

(1 + - 1| < el^l'^' - 1 = / ds< el^l'=fc|x| 







and this implies the following recursion. 

Theorem 2.8 If for some n, X{n) < ^ , we have that 

X{n + 1) < uj^e^'^'^kXin). 
Thus \i oj'^e^'^^k < 1 then it follows from the recursion that 

limX(n) = 0. (10) 

n 

When Lo — ^ [ In fc + In In A; — In In In fc — /3] and /? > In 2 — In In 2, by Lemma 



2.4 



for fc large enough, X{i) < Hence by equation (10 1 we have that X{n) — > 
and so by Proposition |2.1| we have non-reconstruction. Since reconstruction is 
monotone in A and hence in uj it follows that we have non-reconstruction for 
Lu < ui for large fc. This completes the proof of Theorem [l] 
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